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AB A method for performing network reconstruction form microarray 

hybridization data is described. The method is used to identify 
genes showing coordinated regulation or membership of the same 
regulatory network. The method includes the steps of selecting a 
predictor set of features, adding a 
complement to the predictor set based on 

a quality of prediction, checking to see if all of the features of 

the predictor set are repeated, and then 

removing one feature from the predictor set. 

The algorithm and method repeats the steps of adding a 

complement, checking the predictor set 

and removing a feature until the features of the predictor 
set are repeated. If the features of the predictor 
set are repeated, the algorithm and method terminate. 
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bio-informatics, involves adding complement 

to selected predictor set based 

on quality of prediction. 

B04 D16 S03 T01 T02 

KANEVSKY, V; VAILAYA, A 

(KANE- I) KANEVSKY V; (VAIL-I) VAILAYA A 
1 



Searcher : Shears 308-4994 



09/779240 



PATENT NO KIND DATE WEEK LA PG 



US 2002147546 Al 20021010 (200323)* 17 
APPLICATION DETAILS: 

PATENT NO KIND APPLICATION DATE 



US 2002147546 Al US 2001-779240 20010208 



PRIORITY APPLN. INFO: US 2001-779240 20010208 

AN 2003-237981 [23] WPIDS 

AB US2002147546 A UPAB: 20030407 

NOVELTY - Determination (Ml) of predictor set of 

features associated with targets comprising selecting a 

predictor set of features, adding a 

complement ( s ) in the predictor set based 

on a quality of prediction, and checking to see if all of the 
features are repeated, and removing a feature from the 
predictor set, is new. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included 

for: 

(1) network reconstruction comprising selecting a target, 
selecting a predictor set of features, adding a 

complement to the predictor set based on 

a quality of prediction, checking to see if all of the features are 

repeated, and removing one feature from the predictor 

set; 

(2) classification of experiments, comprising selecting a 
target whose values are chosen to represent the class of the 
experiment, and performing the network reconstruction; 

(3) a computer software for performing network reconstruction, 
comprising an algorithm; and 

(4) a system for performing network reconstruction comprising ; 
computer and a computer software for running on the computer, the 
computer software performing network reconstruction. 

USE - (Ml) is useful for determining predictor 
set of features associated with a target useful in the field 
of bio-informatics. It can also be used in information, computer, 
software and data processing systems. 

ADVANTAGE - (Ml) is capable of identifying a good subset of 
genes that can be predicted the target vector. Hence, these genes 
can form a diagnostic set of genes whose expression values are used 
to discriminate between the various types of leukemia. 

DESCRIPTION OF DRAWING (S) - The. figure shows a schematic view 
of the invention in vector format illustrating the target and the 
predictors . 
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dxffrcult and treat^n?!""^^;^^ therapeutic decisions 
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AB Precise estimates of past temperatures are critical for 

understanding the evolution of organisms and the physical biosphere, 
and data from continental areas are an indispensable 
complement to the marine record of stable isotopes. Climate 
is considered to be a primary selective force on leaf morphology, 
and two widely used methods exist for estimating past mean annual 
temperatures from assemblages of fossil leaves. The first approach, 
Leaf Margin Analysis, is univariate, based on the positive 
correlation in modern forests between mean annual temperature and 
the proportion of species in a flora with untoothed leaf margins. 
The second approach, known as the Climate-Leaf Analysis Multivariate 
Program, is based on a modem data set that is multivariate. I argue 
here that the simpler, univariate approach will give 
paleotemperature estimates at least as precise as the multivariate 
method because (1) the temperature signal in the multivariate data 
set is dominated by the leaf-margin character; (2) the additional 
characters add minimal statistical precision and in practical use do 
not appear to improve the quality of the estimate; (3) the predictor 
samples in the univariate data set contain at least twice as many 
species as those in the multivariate data set; and (4) the presence 
of numerous sites in the multivariate data set that are both dry and 
extremely cold depresses temperature estimates for moist and 
nonfrigid paleofloras by about 2 degree C, unless the dry and cold 
sites are excluded from the predictor set. New 

data from Western Hemisphere forests are used to test the univariate 
and multivariate methods and to compare observed vs. predicted error 
distributions for temperature estimates as a function of species 
richness. Leaf Margin Analysis provides excellent estimates of mean 
annual temperature for nine floral samples. Estimated temperatures 
given by 16 floral subsamples are very close both to actual 
temperatures and to the estimates from the samples. Temperature 
estimates based on the multivariate data set for four of the 
subsamples were generally less accurate than the estimates from Leaf 
Margin Analysis. Leaf -margin data from 4 5 transect collections 
demonstrate that sampling of low-diversity floras at extremely local 
scales can result in biased leaf-margin percentages because species 
abundance patterns are uneven. For climate analysis, both modern and 
fossil floras should be sampled over an area sufficient to minimize 
this bias and to maximize recovered species richness within a given 
climate . 
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AB Genes that are differentially expressed in normal and 

diseased tissue are identified for use in the diagnosis and 
prognosis of disease such as cancer, esp. pleural mesothelioma. 
Sets of genes that are expressed differentially in 
malignant pleural mesothelioma are reported. These sets of 
genes can be used to discriminate between normal and 
malignant tissues, and between classes of malignant tissues. 
Accordingly, diagnostic assays for classification of tumors, 
prediction of tumor outcome, selecting and monitoring 
treatment regimens and monitoring tumor progression/regression also 
are provided. 



Searcher 



Shears 308-4994 



09/779240 



L9 ANSWER 2 OF 13 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



AUTHOR (S) 



CORPORATE SOURCE: 



SOURCE : 



PUBLISHER: 
DOCUMENT TYPE 
LANGUAGE : 
AB 



HCAPLUS COPYRIGHT 2003 ACS 
2003:37277 HCAPLUS 
138:233653 

Structural modeling of ataxin-3 reveals distant 
homology to adaptins 

Albrecht, Mario; Hoffmann, Daniel; Evert, Bernd 
0.; Schmitt, Ina; Wullner, Ullrich; Lengauer, 
Thomas 

.Fraunhofer Institute for Algorithms and 
Scientific Computing (S'CAI), Sankt Augustin, 
53754, Germany 

Proteins: Structure, Function, and Genetics 
(2003), 50(2), 355-370 
CODEN: PSFGEY; ISSN: 0887-3585 
Wiley-Liss, Inc. 
Journal 
English 

Spinocerebellar ataxia type 3 (SCA3) is a polyglutamine disorder 
caused by a CAG repeat expansion in the coding region of a 
gene encoding ataxin-3, a protein of yet unknown 

function. Based on a comprehensive computational anal., we propose 
a structural model and structure-based functions for ataxin-3. Our 
predictive strategy comprises the compilation of multiple 
sequence and structure alignments of carefully selected 
proteins related to ataxin-3. These alignments are 
consistent with addnl.' information on sequence motifs, secondary 
structure, and domain architectures. The application of 
complementary methods revealed the homol . of 
ataxin-3 to ENTH and VHS domain proteins involved in 
membrane trafficking and regulatory adaptor functions. We modeled • 
the structure of ataxin-3 using the adaptin AP180 as a template and 
assessed the reliability of the model by comparison with known 
sequence and structural features. We could further infer 
potential functions of ataxin-3 in agreement with known exptl. data. 
Our database searches also identified an as yet uncharacterized 
family of proteins, which we named josephins because of 
their pronounced homol. to the Josephin domain of ataxin-3. 
REFERENCE COUNT: 185 THERE ARE 185 CITED REFERENCES AVAILABLE 

FOR THIS RECORD. ALL CITATIONS AVAILABLE 
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AB The mycobacterial insertion sequence IS.6110 has been exploited 
extensively as a clonal marker in mol. epidemiol. studies 
of tuberculosis. In addn., it has been hypothesized that this 
element is an important driving ' force behind genotypic variability 
that may have phenotypic consequences. The authors present here a 
novel, DNA microarray-based methodol . , designated 

SiteMapping, that simultaneously maps the locations and orientations 
of multiple copies of IS6110 within the genome. To investigate the 
sensitivity, accuracy, and limitations of the technique, 
it was applied to eight Mycobacterium tuberculosis strains for which 
complete or partial IS6110 insertion site information had been detd. 
previously. SiteMapping correctly located 64% (38 of 59) of the 
IS6110 copies predicted by restriction fragment length 
polymorphism anal. The technique is highly specific; 97% 
of the predicted insertion sites were true insertions. 
Eight previously unknown insertions were identified and confirmed by 
PCR or sequencing. The performance could be improved by 
modifications in the exptl. protocol and in the approach to data 
anal. SiteMapping has general applicability and demonstrates an 
expansion in the applications of microarrays that 
complements conventional approaches in the study of genome 
architecture . 

REFERENCE COUNT:, 32 THERE ARE 32 CITED REFERENCES AVAILABLE 

FOR THIS RECORD. ALL CITATIONS AVAILABLE 
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AUTHOR(S): Orfao, Alberto; Chillon, Maria Carmen; 

Bortoluci, Aglae M . ; Lopez-Berges, Maria 
Consuelo; Garcia-Sanz, Ramon; Gonzalez, Marcos; 
Tabernero, Maria Dolores; Garcia-Marcos, Maria 
Antonia; Rasillo, Ana Isabel; Hernandez-Rivas, 
Jesus; Miguel, Jesus F. San 

CORPORATE SOURCE: Servicio General de Citometna, Centro de 

Investigaciones del Cancer de Salamanca and 
Departamento de Medicina, Universidad de 
Salamanca, Salamanca, Spain 

SOURCE: Haematologica (1999), 84(5), 405-412 

CODEN: HAEMAX; ISSN: 0390-6078 

PUBLISHER: II Pensiero Scientifico Editore 

DOCUMENT TYPE:' Journal 

LANGUAGE: English 

AB Rapid identification of AML patients carrying the t(15;17) 

translocation for treatment decision-making is currently made on the 
basis of morphol. screening. However, the existence of both false 
positives and negatives highlights the need for more objective 
methods of screening AML cases and further mol. confirmation 
of the t(15;17) translocation. Here, a total of 111 AML cases were 
analyzed to investigate whether immunophenotyping based on the 



Searcher 



Shears 308-4994 



09/779240 



assessment of multiple-stainings analyzed at flow cytometry could 
improve the sensitivity and specificity of morphol. identification 
of acute promyelocytic leukemia (APL) carrying the t(15;17) 
translocation. FISH anal, was used as a complementary 
technique for cases in which morphol. and mol. biol . yielded 
discrepant results. Concordant results between morphol. and RT-PCR 
were found in 102/111 (91.8%) cases: 34 patients had 
M3/PML-RAR. alpha. + and 68 non-M3/PML-RAR. alpha . - disease. Nine 
cases showed discrepants results. Multivariate anal, showed that 
the best combination of immunol . markers for 

discriminating between M3/PML-RAR. alpha . + and non-M3/PML-RAR. alpha . - 
cases was that of the presence of heterogeneous expression of CD13, 
the existence of a single major blast-cell population, and a 
characteristic CD34/CD15 phenotypic pattern. A score system 
based on these parameters was designed, and the 34 
M3/PML-RAR. alpha. + cases showed a score of 3 (presence of the 3 
. phenotypic characteristics) . In contrast, only 1 out of 

the 68 (1.3%) non-M3/PML-RAR. alpha.- cases had this score, most of 
these latter cases (53/68, 78%) scoring either 0 or 1 . Therefore, 
among these cases, immunophenotyping showed a sensitivity of 100% 
and a specificity of 99% for predicting PML/RAR. alpha . 
gene rearrangements. Of the 9 cases in which morphol. and 
mol. biol. results were discrepant, four cases displayed M3 morphol. 
without PML/RAR. alpha, rearrangements by RT-PCR. In only one of 
these 4 cases did the immunophenotype score 3, this being the only 
FISH-pos. case. From the remaining five discrepant cases (non-M3 
morphol. while pos . for PML/RAR. alpha .) , two cases had a phenotypic 
score of 3 and were FISH-pos., whereas the other three were neg. by 
FISH. Upon repeating RT-PCR studies, two of these latter 
three cases became neg. Thus, immunophenotyping may be of great 
value for quick screening of APL with PML/RAR. alpha . rearrangements. 
REFERENCE COUNT: 4 2 THERE ARE 42 CITED REFERENCES AVAILABLE 

FOR THIS RECORD. ALL CITATIONS AVAILABLE 

IN THE RE FORMAT 
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TITLE: Molecular cloning and characterization .of a 

novel human diacylglycerol kinase .zeta. 
AUTHOR(S) : Bunting, Michaeline; Tang, Wen; Zimmerman, Guy 

A.; Mclntyre, Thomas M.; Prescott, Stephen M. 
CORPORATE SOURCE: Eccles Program Human Mol. Biol. Genet., Univ. 

Utah, Salt Lake City, UT, 84112, USA 
SOURCE: Journal of Biological Chemistry (1996), 271(17), 

10230-6 

CODEN: JBCHA3; ISSN: 0021-9258 
PUBLISHER: American Society for Biochemistry and Molecular 

Biology 

DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Diacylglycerol (DAG) occupies a central position in the synthesis of 
complex lipids and also has important signaling roles. For example, 
DAG is an allosteric regulator of protein kinase C, and 
the cellular levels of DAG may influence a variety of processes 
including growth and differentiation. The authors previously 
demonstrated that human endothelial cells derived from umbilical 
vein express growth-dependent changes in their basal levels of 
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kn* ,nut -K.zeta.) with a predir.^ ™ V 9 nov el • diacylgl vcerol 
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for this portion of the choriogonadotropin receptor is 
complementary in shape and electrostatic 

characteristics to the surface of hCG at an identified focus 
of hormone-receptor interaction. The predicted models for 
the structures and mode of hormone binding of the glycoprotein 
hormone receptors are to a large extent consistent with currently 
available biochem. and mutational data. Repeated 
sequences in . beta . -barrel proteins are shown to have 
general implications for constraints on structure. Averaging 
techniques used here to recognize the structural motif in 
these receptors should also apply to other proteins with 
repeated sequences. 
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TITLE: Identification of interactive sites of 

proteins and protein receptors 
by computer-assisted searches for 
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sequences 

AUTHOR(S) : Fassina, Giorgio; Melli, Marialuisa 

CORPORATE SOURCE: Protein Engineering, TECNOGEN S.C.p.A., Caserta, 

81015, Italy 

SOURCE: ImmunoMethods (1994), 5(2), 114-20 

CODEN: IMUME8; ISSN: 1058-6687 
DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Protein sites important for receptor binding have been 
identified in several systems by searching for protein 
/receptor stretches characterized by hydropathic anti- 
complementarity. A computer-assisted method 
[SITESEARCH] has been developed to identify protein sites 
responsible for receptor recognition, once the amino acid sequences 
of the protein ligand and its receptor are available. The 
software first dets. the hydropathic profiles of the two 
polypeptide chains under investigation, and then compares 
profiles of preselected length, detg. at the same time the degree of 
; their hydropathic complementarity. The procedure is 
repeated until all the different segments in the two chains 
are compared. Fragments characterized by the maximal level of 
hydropathic complementarity are selected as putative 
binding sites. This approach has been initially applied to the 
interleukin-l.beta. ( IL-1 .beta .) /receptor type I case. SITESEARCH 
identified residues 88-99 in IL-1. beta, and 151-162 in the receptor 
as the sequence pair characterized by the maximal level of 
hydropathic complementarity. These fragments, once 
produced by chem. synthesis, have displayed specific recognition 
properties for each other, as detected by solid-phase binding 
assays. The IL-1. beta, sequence identified corresponds to a highly 
exposed part of the protein mol . , and substitution of this 
sequence with another of the same length but with different 
hydropathic characteristics generated mutants with 

drastically reduced binding activity to the receptor. Mutations in 
this sequence did not alter the protein biol. activity, 
thus suggesting the structural integrity of the mutants. 
Cumulatively, these results validate the SITESEARCH 
prediction, suggesting that IL-1. beta, sequence 88-99 is 
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involved in at least a portion of the protein/receptor 
binding site. 
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TITLE: Chicken filensin: A lens fiber cell 

protein that exhibits sequence 
similarity to intermediate filament 
proteins 

AUTHOR (S) : Remington, Susann G. 

CORPORATE SOURCE: Dep. Genet. Cell Biol., Univ. Minnesota, St 

Paul, MN, 55108, USA 
SOURCE: Journal of Cell Science (1993), 105(4), 1057-68 

CODEN: JNCSAI; ISSN: 0021-9533 
DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Filensin, a 100 kDa, membrane-assocd. , cytoskeletal protein 

, is uniquely expressed in the lens fiber cell (Merdes, A., et al., 
1991, J. Cell Biol. 115: 397-410). The author cloned and sequenced 
a full-length chicken lens cDNA encoding filensin, also known as 
CP95' (Ireland, M. and Maisel, H. (1989) Lens and Eye Toxicity 
Research 6, 623-638) . The deduced amino acid sequence of 657 
residues contained an internal 280 residue heptad repeat 
domain with sequence similarities to the rod domain of ■ intermediate 
filament proteins. The putative filensin rod domain could 
be divided into three \ alpha . -helical segments (1A, IB and 2) sepd. 
by short, non-helical linkers. The sequence of the amino-terminal 
end of the filensin rod domain contained the highly conserved 
intermediate filament segment 1A motif (Conway, J. F. and Parry, D. 
A. D. (1988) Int. J. Biol. Macromol. 10, 79-98). Allowing 
conservative amino acid substitutions, the sequence of the 
carboxy-terminal end of the filensin rod domain was similar to that 
of the highly conserved intermediate filament rod carboxy terminus. 
. The . alpha. -helical segments of the shorter filensin rod domain 
aligned with the corresponding segments of intermediate filament 
proteins by allowing a gap of four heptad repeats 
in the amino-terminal half of filensin segment 2. Filensin rod 
segment 2 contained the characteristic stutter in heptad 
repeat phasing, nine heptads from the end of the 

intermediate filament ro.d. The overall sequence identity between 
the rod domains of filensin and individual intermediate filament 
proteins was 20 to 25%, approx. the level of sequence 
identity obsd. between intermediate filament proteins of 
different types. The open reading frame of chicken filensin 
predicted a 657 amino acid protein with mol. mass 
of 7 6 kDa. Embryonic chicken filensin migrated in SDS-PAGE as a 
triplet of 102, 105 and 109 kDa, while rooster filensin migrated as 
a 105 and 109 kDa doublet. Antibodies to filensin labeled lens 
fiber cells but not lens epithelial cells. By immunofluorescence 
methods filensin was localized to the fiber cell plasma 
membranes, including the ends of elongated fiber cells. 
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AUTHOR (S) : Fazioli, Francesca; Minichiello, Liliana; 

Matoskova, Brona; Wong, William T.; Di Fiore, 
Piere Paolo 

CORPORATE SOURCE: Lab. Cell. Mol . Biol., Natl. Cancer Inst., 

Bethesda, MD, 20892, USA 
SOURCE: Molecular and Cellular Biology (1993), 13(9), 

5814-28 

CODEN: MCEBD4; ISSN: 0270-7306 
DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB An expression cloning method which allows direct isolation 

of cDNAs encoding substrates for tyrosine kinases was applied to the 

study of the epidermal growth factor (EGF) receptor (EGFR) signaling 

pathway. A previously undescribed cDNA was isolated and designated 

epslS. The structural features of the predicted 

eps!5 gene product allow its subdivision into 3 domains. 

Domain I contains signatures of a regulatory domain, including a 

candidate tyrosine phosphorylation site and EF-hand-type 

calcium-binding domains. Domain II presents the 

characteristic heptad repeats of coiled-coil 

rod-like proteins, and domain III displays a 

repeated aspartic acid-proline-phenylalanine motif similar 

to a consensus sequence of several methylases. Antibodies specific 

for the eps!5 gene product recognize 2 proteins: 

a major species of 142 kDa and a minor component of 155 kDa, both of 
which are phosphorylated on tyrosine following EGFR activation by 
EGF in vivo. * EGFR is also able to directly phosphorylate the epslS 
product in vitro. In addn., phosphorylation of the epslS 
gene product in vivo is relatively receptor specific, since 
the, erbB-2 kinase phosphorylates it very inefficiently. Finally, 
overexpression of epslS is sufficient to transform NIH 3T3 cells, 
thus suggesting that the epslS gene product is involved in 
the regulation of mitogenic signals. 
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TITLE: Dinucleotide repeat polymorphism at 

the D11S982E locus 
AUTHOR(S): Xiao, Hong; Ide, Susan E.; Merril, Carl R.; 

Polymeropoulos, Mihael H. 
CORPORATE SOURCE: Neurosci. Cent., St. Elizabeths Hosp., 

Washington, DC, 20032, USA 
SOURCE: Human Molecular Genetics (1993), 2(7), 1081 

CODEN: HMGEE5; ISSN: 0964-6906 
DOCUMENT TYPE: Journal 
LANGUAGE: English' 

AB A (AC)n dinucleotide repeat sequence was isolated from a 

human fetal retinal cDNA library by hybridization with a synthetic 
poly (dC-dA) . cntdot . (dG-dT) sequence. The insert of the polymorphic 
clone SE68 contg. the dinucleotide repeat was sequenced by 
the dideoxy method. The predicted length of the 

fragment amplified by PCR primers derived from the insert sequence 
was 121 bp. The frequencies of 5 alleles were estd. from 120 
chromosomes of unrelated individuals. The dinucleotide 
repeat marker was assigned to chromosome 11 using 

2 human-rodent somatic cell hybrid panels. Co-dominant segregation 
was obsd. in 5 informative families. 
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designated SE60. The insert contg. the dinucleotide repeat 
was sequenced using the dideoxy method. The 
predicted length of the amplified fragment was 150 bp 
(GenBank accession no. L04461) . 9 Alleles were detected in 142 
chromosomes of unrelated individuals. The dinucleotide 
repeat marker was assigned to chromosome 14 using 
two human-rodent somatic cell hybrid panels. Co-dominant 
segregation was obsd. in five informative families. 
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SOURCE: Proceedings of the National Academy of Sciences 
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AB The enhanced graphic matrix procedure analyzes nucleic 
acid and amino acid sequences for features of possible 
biol. interest and reveals the spatial patterns of such 
features. When a sequence is compared to itself, the 
technique shows regions of self -complementarity, 
direct repeats, and palindromic subsequences. Comparison 
of 2 different sequences, exemplified by Ig .kappa, light chain 
genes, by using colored graphic matrices showed domains of 
similarity, regions of divergence, and features 

explainable by transpositions. Anal, of mouse const, domain Ig 

sequences revealed self -complementary regions that can be 

used to fold the mol. into a structure consistent with electron 

microscopic observations. Computer translation of nucleic 

acid sequences into all possible amino acid sequences, followed by 

graphic matrix anal., provides a way to detect the most likely 

protein encoding regions and can predict the 

correct reading frames in sequences in which splicing patterns are 
not defined. Application of this technique to regions of 
simian virus 40 and polyoma virus demonstrates the frames of 
translation and shows the agreement of sequences detd. in sep. labs, 
with different virus isolates. The graphic matrix technique 
can also be used to assemble fragmentary sequences during detn., to 
display local variations in base compn., to detect distant 
evolutionary relations, and to display intragenic variation in rates 
of evolution. 
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SOURCE: Global Cosmetic Industry, (Jan 2003) Vol. 171, No. 1 

pp. 37(14) . 

ISSN: ISSN: 1523-9470. 
Allured Publishing Corp. 
Newsletter 
English 
5685 

*FULL TEXT IS AVAILABLE IN THE ALL FORMAT* 
Whether buying in a niche or mass market, store or salon, 
consumers want their hair to be an- outward expression of themselves 
And they are looking to you to help them scream style with healthy, 
shiny, sophisticated hair. 

THIS IS THE FULL TEXT: COPYRIGHT 2003 Advanstar Communications, 
Inc . 



PUBLISHER: 
DOCUMENT TYPE 
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AB 



Subscription: $40.00 per year .. Published monthly. 362 S. 
Rd., Carol Stream, IL 60188-2787. FAX 708-653-2192. 
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INVENTOR (S) : 
PATENT ASSIGNEE (S) : 

COUNTRY COUNT: 
PATENT INFORMATION: 



System for dispensing nanoliter sized droplets in 

defined distribution pattern to form miniarrays 

comprises print head with pipette-based dispensers, 

robotic arm for carrying print head and working 

platform. 

B04 D16 P75 Q39 

SHAFER, D A 

(SHAF-I) SHAFER D A; (GENE-N) GENETAG TECHNOLOGY 

INC 

87 



PATENT NO KIND DATE 



WEEK 



LA PG 



WO 2002040634 A2 20020523 (200253)* EN 63 

RW: AT BE CH CY DE DK EA ES FI FR GB GH GM GR IE IT KE LS LU MC 
MW MZ NL OA PT SD SE SL SZ TR TZ UG ZW 
W: AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI 
GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR 
LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI 
SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW 
US 2002074342 Al 20020620 (200253) 
AU 2002035128 A 20020527 (200261) 



APPLICATION DETAILS: 
PATENT NO KIND 



APPLICATION 



DATE 



WO 2002040634 A2 

US 2002074342 Al Provisional 

AU 2002035128 A 



WO 2001-US43918 
US 2000-248247P 
US 2001-992516 
AU 2002-35128 



20011114 
20001114 
20011114 
20011114 



FILING DETAILS: 



PATENT NO KIND 



PATENT NO 



AU 2002035128 A Based on 



WO 200240634 



PRIORITY APPLN. INFO: US 2000-248247P 20001114; US 2001-992516 

20011114 

AN 2002-500214 [53] WPIDS 

AB WO 200240634 A UPAB: 20020820 

NOVELTY - System (S) for dispensing nanoliter-sized droplets on 
surface in precise pattern of non-overlapping spots to form 
two-dimensional miniarray assay comprising spotter device (I) with 
print head (la) and pipette-based dispensers (lb), a 
robotic/mechanical arm carrying (la) and working platform for 
holding miniarray substrates or loading samples, is new. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included 

for: 

(1) forming (Ml) a miniarray where each known location or spot 
in the miniarray contains an analyte specific reagent for detecting 
an analyte in a sample involves: 

(a) aspirating a solution of each analyte specific reagent with 

(lb), 

(b) pressuring a small defined droplet of analyte specific 
reagent from the narrow opening of the tip of (lb), 
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COUNTRY COUNT: 22 

PATENT INFORMATION: 

PATENT NO KIND DATE WEEK LA PG 
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APPLICATION DETAILS: 

PATENT NO KIND APPLICATION DATE 



WO 2002031170 Al WO 2001-US42529 20011005 



PRIORITY APPLN. INFO: US 2000-239547P 20001010 

AN' 2002-583343 [62] WPIDS 

AB WO 200231170 A UPAB: 20020926 

NOVELTY - Incorporating (Ml) a nucleic acid sequence 
encoding adenovirus into vector backbone, comprising introducing a 
first adenoviral nucleic acid sequence (AS) (SI) encoding 
wild-type, mutant or replication-defective adenovirus, and linear 
vector having second (S2) and third (S3) ASs and a segment (S4) of 
non-AS, into a host cell, where homologous recombination of SI with 
S2 and S3 results in vector with S4 and SI, is new. 

DETAILED DESCRIPTION - Incorporating (Ml) a nucleic 
acid sequence encoding adenovirus into vector backbone, comprising 
introducing SI and a linear vector (I) having second (S2) and third 
(S3) adenoviral nucleic acid sequences and a segment of 
non-adenoviral nucleic acid sequence (S4) into a host 
cell, where SI undergoes homologous recombination with S2 and S3 in 
the linear vector resulting in a vector comprising both the 
non-adenoviral segment of the linear vector and the first adenoviral 
sequence. S2 comprises basepairs corresponding to basepairs 1-100 of 
a wildtype adenovirus sequence not to extend past base pair 500 of a 
wildtype adenovirus sequence, and S3 comprises basepairs 
corresponding to basepairs 35835-35935 of a wildtype adenovirus 
sequence . 

An INDEPENDENT CLAIM is also included for preparing, within a 
vector backbone, recombinant adenovirus carrying a desired 
gene, by: 

(a) generating recombinant adenovirus carrying a desired 
gene in complementation cells by transfecting the 

cells with a vector comprising the desired gene flanked by 
adenoviral sequences homologous to a region within an adenoviral 
genome also present within the cells which is targeted for 
homologous recombination, extracting the recombinant adenoviral 
nucleic acid, and carrying out Ml; or 

(b) the method of: 

(i) introducing SI into complementation cells, where 
SI comprises a restriction enzyme site within a region 
targeted for homologous recombination; 

(ii) generating recombinant adenovirus carrying a desired 
gene in the complementation cells, by transfecting 

the cells with a vector comprising the desired gene 
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flanked by adenoviral sequences homologous to the region 
targeted for homologous recombination within the adenoviral 
genome ; 

(iii) digesting resultant cell products with a restriction 
enzyme capable of digesting the restriction enzyme site; 

(iv) extracting the recombinant adenovirus from the 
complementation cells; and 

(v) introducing into the bacterial cell both (I) and the 
recombinant adenovirus, where SI undergoes homologous recombination 
with S2 and S3, resulting in a vector comprising SI and S4 . 

USE - Ml is for incorporating a nucleic acid sequence 
encoding adenovirus into a vector backbone (claimed) . M2 is useful 
for preparing a recombinant adenovirus carrying a desired 
gene within a vector backbone (claimed), preferably a 
circularized adenovirus. The method is also useful for 
isolating and analyzing variants present within a viral preparation, 
including those which are incapable of forming plaques. The 
method is also useful for producing plasmid preparation that 
are subsequently unsed in transient transfection studies to analyze 
transgene expression. The isolated cones may also be used in rescue 
experiments to identify whether or not these genomes are capable of 
forming infectious virus. 

ADVANTAGE - The method produces a plasmid which is 
more readily analyzed by restriction digestion, polymerase chain 
reaction, DNA sequencing, or used in transient 

transfection studies. The adenovirus plasmids that are generated can 
be rescued back into virus form. The entire procedure took four days 
or less instead of the weeks required for plaque purification or 
dilution cloning isolating techniques. The method 

does not require the use of tissue culture materials or facilities. 
The method also allows for a more extensive and thorough 
examination of viral preparation, in that it allows for the 
detection of variants incapable of propagation without the 
assistance of coinfecting intact adenoviral genomes. Under standard, 
conditions of plaque purification the variant genomes are not 
detected. It is predicted that far more variant genomes 
will be observed using the rapid method than would 
otherwise be detected by standard plaque purification method 
. The adenovirus is more readily manipulated and easier to screen. A 
large number of samples can be handled and processed easily. This 
amounts to a significant time and cost advantage. Large amounts of 
each adenoviral genome may be produced by cloning Escherichia coli, 
so that analysis is made by simple ethidium bromide staining after 
gel electrophoresis, rather than the more previous radiolabelling 
techniques used to visualize viral DNA. The 

adenoviral terminal portions utilized on the plasmid encompass much 
shorter regions of the adenoviral ends than previous vectors. This 
relieves the carry constrains of the importing plasmid, and the 
range of mutants capable of isolation through homologous 
recombination is broadened. 
Dwg. 0/4 
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AUTHOR(S) : Jiang, Xiaoyan; Hanna, Zaher; Kaouass, Mohammadi; 
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CORPORATE SOURCE: (1) Laboratory of Molecular Biology, Clinical 

Research Institute of Montreal, 110 Pine Ave., West, 
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SOURCE: Journal of Virology, (September, 2002) Vol. 76, No. 

18, pp. 9046-9059. http://intl-jvi.asm.org/. print. 

ISSN: 0022-538X. 
DOCUMENT TYPE: Article 
LANGUAGE: English 

AB The Ahi-1 locus was initially identified as a common helper provirus 
integration site in Abelson pre-B-cell lymphomas and shown to be 
closely linked to the c-myb proto-oncogene . Since no significant 
alteration of c-myb expression was found in Abelson murine leukemia 
virus-induced pre-B-lymphomas harboring a provirus inserted within 
the Ahi-1 locus, this suggested that it harbors another gene 
whose dysregulation is involved in tumor formation. Here we report 
the identification of a novel gene (Ahi-1) 
targeted by these provirus insertional mutations and the 
cloning of its cDNA. The Ahi-1 proviral insertions were found at the 
3 ! end of the gene, in an inverse transcriptional 

orientation, with most of them located around and downstream of the 
last exon, whereas another insertion was within intron 22. In 
addition, another previously identified provirus insertion site, 
Mis-2, was found to map within the 16th intron of the Ahi-1 
gene. The Ahi-1 cDNA encodes a 1, 047-amino-acid 
protein. The predicted Ahi-1 protein is 

a modular protein that contains one SH3 motif and seven 
WD4 0 repeats. The Ahi-1 gene is conserved in 

mammals and encodes two major RNA species of 5 and 4.2 kb and 
several other shorter splicing variants. The Ahi-1 gene is 
expressed in mouse embryos and in several organs of the mouse and 
rat, notably, at high levels in the brain and testes. In tumor cells 
harboring insertional mutations in Ahi-1, truncated Ahi-l/viral 
fused transcripts were identified, including some splicing variants 
with deletion of the SH3 domain. Therefore, Ahi-1 is a novel 
gene targeted by provirus insertion and encoding a 
protein that exhibits several features of a 

signaling molecule. Thus, Ahi-1 may play an important role in signal 
transduction in normal cells and may be involved in tumor 
development, possibly in cooperation with other oncogenes (such as 
v-abl and c-myc) or with a tumor suppressor gene (Nf 1) , 
since Ahi-1 insertion sites were identified in tumors harboring 
v-abl defective retroviruses or a c-myc transgene or in tumors 
exhibiting deletion of Nfl. 
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NOVELTY - An isolated 33358 polypeptide (a human ankyrin 

family member, referred to as cardiac/skeletal muscle-restricted 

ankyrin-repeat containing protein (C/SKARP) -1 ) , 

(I, ) comprising a fragment which comprises 10 contiguous amino acids 
of a fully defined sequence of 323 amino acids (S2) , or comprising 
60% identity to (S2) , is new. 

DETAILED DESCRIPTION - Where (I): 

(a) comprises a fragment which comprises 10 contiguous amino 
acids of a fully defined sequence of 323 amino acids (S2); 

(b) is a naturally occurring allelic variant of (S2) which is 
encoded by a nucleic acid molecule that hybridizes to a 
complement of a fully defined sequence of 1538 (SI) or 972 

(S3) nucleotides under stringent conditions: 

(c) is a polypeptide which is encoded by a 

nucleic acid molecule comprising a nucleotide sequence which 

is at least 60% identical to a nucleotide sequence of (SI) or (S3) ; 

or 

(d) is a polypeptide comprising an amino acid 
sequence which is 60% identical to (S2) . 

INDEPENDENT CLAIMS are also included for the following: 
(1) an isolated nucleic acid molecule (II); 
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(e- g ., (l non-C f %Sr™ 8 ndin9 ^ <~ proteins 
(iv) bridging of cellular components- 

«- M^WJET ™- < eardiac 

»od ul ate cellar locali^o' £ g f c/SKflRp 

Searcher : shears 308 . 4 „ 4 



09/779240 



(3 L 0d a^ r \e C r1e ia g C "r? v ^ ^P^nesis; 
myogenic cell lineage; and ^ marker) of ^rdiac and/or 

disor^s m ° dUlate 3nd/0r treat C/SKARP- 1 - associated Qr related 

veto" 11 a\L\ e ^ii f ° i r n e Se eS t S h e ng (I) VU 3 "ccbinant expression 
™, or genetic alteration L ^asTa^ 1 ^, 3 ^ 0115 ' to ^tect 
activity. Fragments of (U) a " , „ ? ene and to modulate 33358 
Portions or fragments of (II) are usef ?% probes and Primers. 

Iti^VJZ:'" a " lndiTldUal fr °" * biologic! sa„plo 

clinical testing procedEe? JV S'dJ? tiSSU6 ' as a P a rt of a 
given treatment regimen rv) \,'', determine the efficacy of a 
Proteins, regulating t£e MoavaiLb?! 4 f ° r is ^ a ting 33?58 
modulating 33358 activity ^ )in "nd m 33358 Pr ° tei * s and 

(a) screening assays; ' " d (V) are useful for: 

(b) predictive medicine (en 

assays monitoring clinical tri^' t„ 9 T^ C 3SSays ' Prognostic 

(c) methods of treatment tf pharmac °genetics) ; and 
33358 molecules ^'therapeutic and prophylactic) 

diagnostic target and therape^c ai'^? USSful as a " 
treatment and diagnosis of 33^58 meS J" 3SSayS a PPlicable to 
diseS 19 ? hypertro P^ cardiac Orders a ° n r d " lated disorders such 
disease, (e.g., congestive heart- ff?f and/or cardiovascular 
arteriosclerosis, ischemif rpnf % * lur6 ' cardi °myopathy) , 
tachycardia, bradycardia a f?™ * njury ' ^stenosis 
infarction, coronary arter^^a^*!" 8 ^' myoca rdiai 
syndrome, aneurysm, aortic vTlTlt, 130 ^ 0 disease - raynaud's 
or disorder also includes an eSo?Kv^ S - ? cardi °vascular disease 
molecules are useful for treatS Jh "Vf 1 border, and thus the 
as tumorigenesis, tumor mSSsL n. endothel ial cell disorder such 
endometriosis, Grave's disease ^Ahf™ * S1S ' diab *tic retinopathy 
atherosclerosis) an H „u ■ ' lsche mic disease (e a "F«ny, 

rheumatoid arthritis"?. IhT^sTmoT^r "seaside!,,., 
markers for one or more disorders or d 63 are USeful a * Surrogate 
conditions leading up to disfaJf a ? , dlsea se states or for 
5£ y 3 pharmac °genomic marklrs ^ 33 P ha ™acodyanamic markers 



ACCESSION^NUMBErT 33 200^ d^Ll°° r n TH0MS0N DERWENT 
DOC. NO. CPI: C2nm ,SL 9 , [71] WPI °S 

TITLE • L ^UU1-184 682 

Searcher : qk^-. 

Shears 308-4994 



09/779240 



DERWENT CLASS : 
INVENTOR (S) : 
PATENT ASSIGNEE (S) : 

COUNTRY COUNT: 
PATENT INFORMATION : 



of nucleic acid mixtures at addressable 

locations on substrate and exposing array to probe 

for detecting nucleic acid molecules on 

array. 

B04 D16 

MARINCOLA, F M; MILLER, L D; WANG, E 
(USSH) US DEPT HEALTH & HUMAN SERVICES; (USSH) US 
NAT INST OF HEALTH 
95 



PATENT NO KIND DATE 



WEEK 



LA '. PG 



WO 2001073134 A2 20011004 (200171)* EN 



60 



RW: 


AT 


BE 


CH 


CY 


DE 


DK 


EA 


ES 


FI 


FR 


GB 


GH 


GM 


GR 


IE 


IT 


KE 


LS 


LU 


MC 




MW 


MZ 


NL 


OA 


PT 


SD 


SE 


SL 


SZ 


TR 


TZ 


UG 


ZW 
















W: 


AE 


AG 


AL 


AM 


AT 


AU 


AZ 


BA 


BB 


BG 


BR 


BY 


BZ 


CA 


CH 


CN 


CO 


CR 


cu 


cz 




DE 


DK 


DM 


DZ 


EE 


ES 


FI 


GB 


GD 


GE 


GH 


GM 


HR 


HU 


ID 


IL 


IN 


IS 


JP 


KE 




KG 


KP 


KR 


KZ 


LC 


LK 


LR 


LS 


LT 


LU 


LV 


MA 


MD 


MG 


MK 


MN 


MW 


MX 


MZ 


NO 




NZ 


PL 


PT 


RO 


RU 


SD 


SE 


SG 


SI 


SK 


SL 


TJ 


TM 


TR 


TT 


TZ 


UA 


UG 


US 


UZ 




VN 


YU 


ZA 


zw 



































AU 2001051069 A 20011008 (200208) 



APPLICATION DETAILS: 
PATENT NO KIND 



APPLICATION 



DATE 



WO 2001073134 A2 
AU 2001051069 A 



WO 2001-US9993 
AU 2001-51069 



20010328 
20010328 



FILING DETAILS: 



PATENT NO 



KIND 



PATENT NO 



AU 2001051069 A Based on 



WO 200173134 



PRIORITY APPLN 
AN 



INFO: US 2000-192700P 20000328 
2001-616539 [71] WPIDS 
AB WO 200173134 A UPAB: 20011203 

NOVELTY - A gene expression assay (Ml) involves providing 
an array (A) of nucleic acid mixtures at addressable 
locations on a substrate, where the mixtures comprise 
polynucleotides (P) in quantities substantially proportional to 
quantities of polynucleotides in a specimen from which P are 
obtained, and exposing A to a probe for detecting P on A under 
conditions sufficient to produce binding of probe to P. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for 
the following: 

(1) an assay (M2) to determine relative expression of a 
DNA sequence in a number of biological specimens, involves 
providing a labeled probe, contacting the labeled probe with an 
array of mixtures of nucleic acid molecules arrayed on a 
surface of a solid support, under conditions sufficient to produce 
binding, where each mixture of nucleic acid molecules 
proportionately reflects expression levels of RNA molecules from a 
specimen from which the nucleic acid molecules are 
obtained, separating unbound labeled probe from the array, and 
detecting probe binding on the array; 
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(2) a gene profiling array (I) comprises a number of 
mixtures of nucleic acid molecules immobilized on a solid 
support in an addressable pattern, where each mixture 
proportionately reflects expression levels of mRNA molecules in a 
specimen from which the nucleic acid molecules are 

obtained; 

(3) a kit (II) for determining relative expression of a 
DNA sequence in a number of biological specimens, comprises 
(I), and instructions for using (I); 

(4) an assay (M3) for analyzing a number of gene 
expression profiles, comprises providing (I), exposing (I) to a 
first probe that may hybridize to the nucleic acid 
molecules of the array to identify those nucleic acid 
molecules to which the first probe hybridizes, detecting a first 
hybridization pattern of the first probe, and repeating 

the exposing and detecting steps with a second probe to identify 
samples to which the second probe hybridizes; 

(5) producing (M4) a mixture of mRNA-derived nucleic 

acid molecules, involves isolating an RNA sample from a specimen, 
obtaining one or more RNA templates from a portion of the RNA 
sample, hybridizing the templates with a first primer to form a 
primed template, where the first primer comprises an antisense 
sequence of an RNA polymerase promoter, synthesizing first strand 
cDNA from the primed template, hybridizing the first strand cDNA 
with a second primer to form a switched template, where the second 
primer has a 5' end and a 3' end and comprises a string of dG 
residues at the 3 1 end, synthesizing second strand cDNA from the 
switched template to generate full-length double stranded cDNA, 
transcribing antisense (aRNA) from the full-length double stranded 
cDNA, and reverse transcribing amplified cDNA from the transcribed 
aRNA; and 

(6) an array (III) used in Ml. 

USE - M2 is useful to determine relative expression of a 
DNA sequence in a number of biological specimens (claimed) . 
Ml or (I) is useful for analyzing gene expression, 
particularly for comparing gene expression in a number of 
cells or tissues simultaneously. Ml is useful to examine progression 
of gene expression changes both in the same and in 
different tumor types or in diseases other than neoplasia. (I) is 
useful to evaluate genetic drift, population differences, 
progressive speciation, and other such evolution-related phenomena, 
and to track and study genetically-linked diseases such as cancer, 
polycystic kidney disease, Huntington's disease and hemophilia. (I) 
is useful to identify and analyze prognostic markers or 
markers that predict therapy outcome for various 

diseases or abnormal conditions, such as cancer. (I) is useful for 
high throughput analysis of differential gene expression 
in a specimen (such as a tumor) or a variety of specimens (such as a 
variety of tumor), and for automated preparation and analysis. 

ADVANTAGE - M4 produces high-fidelity, amplified mixtures of 
nucleic acid molecules using a combination of aRNA 
amplification and template-switching synthesis. (I) allows high 
throughput analysis of differential gene expression in a 
specimen (such as a tumor) or a variety of specimen (such as a 
variety of tumor) , and are suitable for automated preparation and 
analysis. Thousands of different kinds of cells and tissues can be 
analyzed for gene expression simultaneously. In addition, 
multiple genes are simultaneously profiled using probes 
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AB Monoclonal antibodies have numerous potential applications as 
therapeutics. As target-specific drugs, they should cause 
fewer side effects than many treatments for cancer and other 
diseases. However, immunogenicity has been a significant barrier to 
clinical success. Several routes of molecular engineering are being 
explored to prevent immune response to MAbs in patients, and the 
best route has yet to be proven. 
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Most process models of how advertising works are linear and posit 
that receivers of advertising messages move through several 
hierarchical stages traditionally described as Attention, Interest 
Desire, and Action (AIDA) ; Awareness, Comprehension, Conviction, 
and Action (DAGMAR); or Knowledge, Liking, Preference, Conviction, 
and Purchase (Learn-Feel-Do) . 
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AB An insertion sequence designated IS1626 was isolated and 

characterized from a Mycobacterium avium clinical strain. IS1626 was 
detected by high-stringency hybridization with the pMB22/S12 probe 
from IS900 of Mycobacterium paratuberculosis . IS1626 is 1418 bp in 
size and has a G+C content of 65 mol%. It has neither terminal 
inverted repeats nor flanking direct repeats. 

Analysis of three IS1626 insertion sites in the M. avium strain the 

corresponding potential insertion • sites in two IS1626-free M. avium 

strains indicated a consensus sequence of CATGCN(4-5) TCCTN(2) G for 

IS1626 insertion. In the three clones examined, IS1626 has the same 

orientation with respect to this target site. IS1626 has 

two major ORFs . ORF1179 encodes a predicted 

protein of 393 amino acids. ORF930, on the 

complementary strand of ORF117 9, encodes a protein 

of 310 amino acids. The Shine-Dalgarno sequence for ORF930 is 

partially located in the flanking region, similar to other 

IS900-related elements. Analysis of the comparable features 

of insertion sequences and their variable occurrence in related 

organisms is useful for studying the evolution of these elements and 

their hosts. 
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AB Introduction 

Acquired immunodeficiency syndrome has attracted much attention in 
the medical field, as well as in the general community. Less than 20 
years old, this syndrome has now become a major cause of morbidity 
and mortality in the United States, particularly among young adults. 
It is characterized by the development of opportunistic infections 
and malignancies resulting from a weakened immune system. 
AIDS was first described in the United States in 1981, when cases of 
Pneumocystis carinii pneumonia and Kaposi's sarcoma were reported 
among young homosexual men in New York and California. In 1983, 
researchers isolated human immunodeficiency virus as the pathogen 
causing AIDS and serologic tests were developed two years later to 
detect antibodies to HIV in the blood. This facilitated detection in 
patients who were clinically asymptomatic and had not yet developed 
AIDS-related opportunistic infections and donated blood could now be 
screened. Shortly thereafter, in 1987, the first antiretroviral 
agent for treating HIV infection was approved by the U.S. Food and 
Drug [Administration. sup. (1) (2) ] . 

Over the past 15 years, important advances have been made in 
understanding this syndrome and its clinical management. Advances in 
technology have made it possible to quantify the number of HIV RNA 
particles in plasma, which may be useful in predicting 
disease progression and in guiding treatment of HIV infection. The 
approach to treating HIV infection has also changed, in that 
combination antiretroviral therapy has been shown superior to 
monotherapy, which was considered the standard of care several years 
ago. 
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Pharmacists are in a unique position to screen for overlapping 
toxicities and drug interactions in therapeutic regimens 
characterized by polypharmacy and the well-informed pharmacist can 
play a key role in the education, treatment, compliance and 
prevention efforts so crucial for the success of HIV/AIDS therapy. 
THIS IS AN EXCERPT: COPYRIGHT 1998 Lebhar- Friedman Inc. 
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*FULL TEXT IS AVAILABLE IN THE ALL FORMAT* 
Mark A. Schenerman and Kristan Phillips 

Comparability refers to the assessment of similarities and 
differences - whether due to modifications of raw materials, 
process, or a manufacturing facility - of a product manufactured by 
two means (1) . FDA recognizes that the development of a biological 
product may require changes in the method of manufacture 
and the location of manufacture before or after biological license 
application (BLA) approval. Comparability is demonstrated through 
equivalence of the safety, identity, purity, and potency of product 
manufactured before and after a modification is made. When 
developing a comparability testing program, you need to consider the 
potential impact of the manufacturing change on the product and the 
ability to assess this impact. Note that from a regulatory 
perspective the term bioequivalence is generally applied to generic 
drug products regulated by the Office of Generic Drugs at the Center 
for Drug Evaluation and Research (CDER) . 

This concept of comparability represents a new regulatory paradigm 
for biological products. For many years biologies were considered 
too complex and our analytical tools too primitive to provide an 
accurate picture of our final product. 
THIS IS AN EXCERPT: COPYRIGHT 1997 Aster Publishing Corporation 
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AB Delayed contact hypersensitivity (DCHS) present two original 

features: the route of administration of the antigen is 
epicutaneous and the presenting cells are essentially epidermal 
Langerhans cells. This disease typically occurs after 
repeated contact of the skin with occupational or 
environmental sensitizing agents. Various cell types, various 
mediators of inflammation and many cytokines are involved in the 
regulation of the inflammatory and immunological processes of DCHS. 
Keratinocytes, the first target of contact allergens, play 
an active role in this process by the production of cytokines, but 
the pivot cell is the Langerhans cell (LC) , a dendritic cell of the 
epidermis, which is able to transmit the antigenic information of 
allergens and induced a specific proliferative lymphocyte response 
to the antigen. DCHS reactions occur in two phases: a first 
so-called sensitisation phase during which the hapten binds to the 
LC which then migrates to the proximal lymph nodes, where it 
transmits the information to T lymphocytes. This lymphocyte 
activation results in T cell clonal expansion. These cells then 
return to the circulation. They correspond to memory T cells. The 
so-called revelation step of DCHS is involved during a subsequent 
contact of the skin with the same allergen. LC perform the same 
process. Circulating memory T cells and memory T cells present in 
the dermis are then restimulated by LC and this activation induces 
cytokine production, responsible for the inflammatory reaction 
characteristic of DCHS, in which lymphocytes, macrophages 
and eosinophil polymorphonuclear cells are involved. The early 
events are still poorly elucidated (penetration of the hapten, 
binding with epidermal proteins, capture by LC) , as is 
processing of the antigen by LC which results in presentation to T 
cells in the lymph node of an immunogenic substance recognized by a 
specific receptor. On the other hand, the migration of LC and their 
primary stimulation function has been more clearly defined. 
Adherence molecules are involved and, during LC-T cell cooperation, 
accessory molecules, including those belonging to the B7 family, 
play an essential role. Homing, preferential guidance of specific T 
lymphocytes towards the site of penetration of the hapten is 
starting to be more clearly understood due to the discovery of 
specific molecules er;pressed by endothelial cells (E-selectins, 
ICAM, VCAM) and T lymphocytes (CLA) . Various proinflammatory or 
antiinflammatory cytokines (IL1, IL6, IL12) produced in the skin 
participate In regulation of the inflammatory process. Ultraviolet 
rays (UV) also have a modulatory role on the production of these 
cytokines and the cell functions involved. Finally, the causes 
lending to either an inflammatory reaction or a tolerance situation, 
i.e. lymphocyte clonal anergy, are still poorly elucidated. Current 
hypotheses involve cytokines (IL12), lymphocyte subpopulations (THO, 
TH1, TH2) and a possible antigen-presenting function of 
keratinocytes to explain this active situation of non-response or 
restoration of normality. In vitro reproduction of delayed contact 
hypersensitivity (DCHS) has two objectives: 1) a better 
understanding of the: pathophysiological mechanisms, 2) development 
of laboratory tests allowing prediction of the allergenic 
property of substances applied to human skin. 

For this purpose, predictive methods have 
been based, up until recently, on the production of DCHS in animals 
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procedures . 

Traditionally, the pyramidal approach to therapy of RA has been 
useful. Initially, treatment consists of education, rest, exercise, 
and patient counseling, combined with basic anti-inflammatory 
therapy with salicylates or other NSAIDs. Additional therapies, or 
second-line agents, may be added to this foundation. 
Because of the potential destructive impact of RA, therapy has 
become more aggressive. The exact nature and timing of the added 
therapies, however, depend on the experience of the physician and 
the response of the patient. 
References are available upon request. 

Ahmed H. Hikal is Associate Professor, Pharmaceutics Department, and 
Research Associate Professor, Research Institute of Pharmaceutical 
Sciences, at the School of Pharmacy, The University of Mississippi 
Ethel M. Hikal is a Relief Pharmacist, South Panola Community 
Hospital, Batesville, Miss., Tallahatchie General Hospital, 
Charleston, Miss., and at Wal-Mart Pharmacy, Oxford, Mississippi 
NONSTEROIDAL ANT I -INFLAMMATORY DRUGS (NSAIDs) 
# 



NSAID 
Aspirin 

Diclofenac sodium 
Dif lunisal 
Etodolac 

Fenoprofen calcium 
Flurbiprofen 
Ibuprofen 
Indomethacin 
Ketoprof en 

Meclof enamate sodium 

Nabumetone 

Naproxen 

Oxaprozin 

Piroxicam 

Sulindac 

Tolmetin sodium 



Half-life Daily dosage 

(hr) (mg) 

4-15 1000-6000 in 4 doses 

1-2 100-200 in 2 doses 

8-12 500-1000 in 2 doses 

7 600-1200 in 1-4 doses 

2.5 1800-3200 in 3-4 doses 

3-4 200-300 in 2-3 doses 

1- 3 1200-3200 in 3-4 doses 

2- 5 50-200 in 2-4 doses 
2-4 150-300 in 3-4 doses 

2 200-400 in 3-4 doses 

22.5-30 500-2000 in 1-2 doses 

12-15 500-1500 in 1-3 doses 

42-50 1200-1800 single dose 

38-45 30 in a single dose 

18 150-300 in 2 doses 

1 1200-2400 in 3 doses 
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determining calculated binding constants of the corresponding 
preselected DNA sequence by means of the equation: 

In (kII/kI)=(kappa/RT) (DeltaGDoI-DeltaGDoII ) , 

wherein kl and kll are composite reaction rate constants for 
amplification using potential DNA primers I and II, 
respectively, KI and KII are binding constants for an amplifying 
reagent binding to the preselected or native DNA sequence 
to which potential DNA primers I and II anneal, 
respectively, R is the universal gas constant, T is absolute 
temperature, kappa is a proportionality constant or function. 

wherein kappa is predetermined in accord with the equations for 
the amplifying reagent and for set sequence length by relating 
measured composite rates of amplification or binding with potential 
DNA primers or set of potential DNA primers I and 
II to their respective differences in free energies of melting 
DeltaGDol and DeltaGDoII, which comprises: 

calculating free energies of melting DeltaGDo for at least two 
potential DNA primers if kappa is a proportionality 

constant, or at least three if a function, determined in accord with 
the summing step; 

measuring composite rates of amplification or generation of 
amplification products or binding constants for synthetic or native 
DNA sequences to which the potential DNA primers 
anneal; and 

relating the measured composite rates of reaction or binding 
constants to their respective differences in free energy of melting 
DeltaGDo in accord with the equations; and 

synthesising the potential DNA primers by chemical or 
biochemical methods; 

measuring composite reaction rates of amplification or 
generation of amplification products using the potential DNA 
primers by any method suited for the purpose; 

repeating the iterating, calculating, synthesising 
and measuring steps to determine at least one additional potential 
DNA primer which amplifies, or generates amplification 
products for, the preselected or native DNA sequence, does 
not interact with any other primer, and has a composite reaction 
rate approximately equal to that of the first potential DNA 
primer or set of potential DNA primers; and 

choosing at least one set of potential DNA primers 
with approximately equal calculated composite reaction rates for 
amplifying, or generating amplification products for, at least two 
different regions from the corresponding preselected or native 
DNA sequence using each set of potential DNA 

primers, wherein calculated composite reaction rates fall within a 
predefined deviation about a mean composite reaction rate; and using 
at least one set of potential DNA primers to detect the 
presence or absence of a preselected or native DNA 
sequence, which comprises: 

combining aliquots of at least one set of DNA primers 
with an analytical unknown sample which may or may not contain a 
preselected or native DNA sequence; 

performing the amplifying reaction to generate amplified 
concentrations or amplification products of the preselected or 
native DNA sequence, if present; and 

observing by any suitable qualitative or quantitative 
method the presence or absence of, the preselected or native 
DNA sequence, thereby detecting the presence or absence of 
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IS1001 from Bordetella paJapfrtusi^ f^Sr^ ^senteroides and 
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researchers tried to optimise the data obtained for the widely 
differing colour measurement requirements of different industries 
but for many, the essentially laboratory-based instrumentation could 
not provide the necessary data quickly enough to control colour in 
real time. 

For batch production where processing can be held until colour 
results are obtained this method of control is adequate, 
but for the true dynamic process, it is less than satisfactory. 
What is required is instrumentation capable of measuring directly in 
the process, with minimum disruption to the process. Data should be 
produced instantaneously with no operator intervention and the 
instrumentation should be capable of withstanding the harsh process 
environment. Such systems indeed exist, but for many the capital 
outlay required to acquire and implement the technology has been 
unacceptably high. Until now. 

THIS IS AN EXCERPT: Copyright 1993 FMJ International Publications 
Limited 

(FILE 'HCAPLUS* ENTERED AT 11:24:32 ON 16 APR 2003) 
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TITLE: Identification of co-regulated genes through 

Bayesian clustering of predicted 

regulatory binding sites 
AUTHOR (S) : Qin, Zhaohui S.; McCue, Lee Ann; Thompson, 

William; Mayerhofer, Linda; Lawrence, Charles 

E. ; Liu, Jun S. 

CORPORATE SOURCE: Department of Statistics, Harvard University, 

Cambridge, MA, 02138, USA 
SOURCE: Nature Biotechnology (2003), 21(4), 435-439 

CODEN: NABIF9; ISSN: 1087-0156 
PUBLISHER: Nature Publishing Group 

DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB The identification of co-regulated genes and their 

transcription-factor binding sites (TFBS) are key steps toward 
understanding transcription regulation. In addn. to effective lab. 
assays, various computational approaches for the detection of TFBS 
in promoter regions of coexpressed genes have been developed. The 
availability of complete genome sequences combined with the 
likelihood .that transcription factors and their cognate sites are 
often conserved during evolution has led to the development of 
phylogenetic f ootprinting . The modus operandi of this technique is 
to search for conserved motifs upstream of orthologous genes from 
closely related species. The method can identify hundreds of TFBS 
without prior knowledge of co-regulation or coexpression . Because 
many of these predicted sites are likely to be bound by 
the same transcription factor, motifs with similar patterns can be 
put into clusters so as to infer the sets of co-regulated genes, 
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site of tumor origin in the context of multiple diverse tumor 
classes . 
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Primer on medical genomics: Part III: 
Microarray experiments and data analysis 
Tefferi, Ayalew; Bolander, Mark E . ; Ansell, 
Stephen M. ; Wieben, Eric D.; Spelsberg, Thomas 
c. 

Division of Hematology and Internal Medicine, 
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Dowden Health Media, Inc.. 
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English 

A review. Genomics has been defined as the comprehensive study of 
whole sets of genes, gene products, and their interactions as 
opposed to the study of single genes or proteins. 
Microarray technol. is one of many novel tools that are 
allowing global and high-throughput anal, of genes and gene 
products. In addn. to an introduction on underlying principles, the 
current review focuses on the use of both complementary 
DNA and oligodeoxynucleotide microarrays in gene 

expression anal. Genome-wide expts. generate a massive amt. of data 
points that require systematic methods of anal, to ext. biol. useful 
information. Accordingly, the current educational communication 
discusses different methods of data anal., including supervised and 
unsupervised clustering algorithms. Illustrative clin. 
examples show clin. applications, including (1) identification of 
candidate genes or pathol. pathways (ie, elucidation of 
pathogenesis); (2) identification of "new" mol. classes of diseases 
that may be relevant in disease reclassification, prognostication, 
and treatment selection (ie, class discovery); and (3) use of 
expression profiles of known disease classes to predict 
diagnosis and classification of unknown samples (ie, class 
prediction) . The current review should serve as an 
introduction to the subject for clinician investigators, physicians 
and medical scientists in training, practicing clinicians, and other 
students of medicine. 
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Only a small fraction of short oligonucleotide probes bind 
efficiently to complementary segments in long RNA 

transcripts. Technologies such as array-based transcript profiling 
and antisense control of gene expression would benefit greatly from 
a method for predicting probes that bind well to a given 
target RNA. To develop an algorithm for prioritizing 
selection of probes, we have analyzed predicted 

thermodynamic parameters for the binding of several large sets of 
probes to complementary RNA transcripts. The binding of 
five of these sets of probes to their RNA targets has been reported 
by others. In addition, we have used a method for light-directed 
synthesis of oligonucleotide arrays that we developed to generate 
two new arrays of surface-bound probes and measured the binding of 
these probes to their RNA targets. We considered predicted 
free energies for intramolecular base pairing of the oligonucleotide 
and its RNA target as well as the predicted free energy of 
intermolecular hybridization of probe and target. We find that a 
reliable predictor of probes that will hybridize 
significantly with their targeted transcripts is the 
predicted free energy of hybridization minus the 
predicted free energy for intramolecular folding of the 
probe . 
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Identification of co-regulated genes through Bayesian 
clustering of predicted regulatory binding 
sites . 

Qin Zhaohui S; McCue Lee Ann; Thompson William; 
Mayerhofer Linda; Lawrence Charles E; Liu Jun S 
Department of Statistics, Harvard University, 
Cambridge, MA 02138. 
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development of phylogenetic f ootprinting . The modus operandi of 
this technique is to search for conserved motifs upstream of 
orthologous genes from closely related species. The method can 
identify hundreds of TFBS without prior knowledge of co-regulation 
or coexpression. Because many of these predicted sites 
are likely to be bound by the same transcription factor, motifs with 
similar patterns can be put into clusters so as to infer the sets of 
co-regulated genes, that is, the regulons . This strategy utilizes 
only genome sequence information and is complementary to 
and confirmative of gene expression data generated by 
microarray experiments. However, the limited data available- 
to characterize individual binding patterns, the variation in motif 
alignment, motif width, and base conservation, and the lack of 
knowledge of the number and sizes of regulons make this inference 
problem difficult. We have developed a Gibbs sampling-based 
Bayesian motif clustering (BMC) algorithm to address these 
challenges. Tests on simulated data sets show that BMC produces 
many fewer errors than hierarchical and K-means clustering methods. 
The application of BMC to hundreds of predicted 
gamma-proteobacterial motifs correctly identified many 
experimentally reported regulons, inferred the existence of 
previously unreported members of these regulons, and suggested novel 
regulons . 

L23 ANSWER 3 OF 23 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. 
ACCESSION NUMBER: 2003:149277 BIOSIS 
DOCUMENT NUMBER: PREV20030014 9277 

TITLE: Identification of potential serum markers for tumors 

with lymph node metastasis based on gene expression 
microarray . 

AUTHOR (S ) : Roh, Mee-Sook (1); Kwon, Hyuk-Chan; Jeong, Jin-Sok 

(1); Kim, Jae-Seok; Rha, Seo-Hee (1); Lee, Hye-Jeong; 
Jeong, Jin-Sook (1); Lee, Ki-Nam; Choi, Phil-Jo; Son, 
Choon-Hee; Yoon, Jin-Han; Shin, Sang-Hun; Kim, 
Hyo-Jin; Choi, Seok-Ryeol; Lee, Jong-Hoon; Song, 
Gi-Hoon; Hwang, Tae-Ho 

CORPORATE SOURCE: (1) Department of Pathology, Medical College, Dong-A 

University, Busan, 602-714, South Korea South Korea 

SOURCE: Cancer Gene Therapy, (January 2003, 2003) Vol. 10, 

No. Supplement 1, pp. S40-S41. print. 
Meeting Info. : Eleventh International Conference on 
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NOVELTY - Analyzing a nucleic acid sequence by constructing a 

context functional descriptor (CFD) , is new. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for 

the following: 

(1) a method of identifying a CFD component associated with a 
property of a nucleic acid sequence or a peptide encoded by the 
nucleic acid; 

(2) a method of comparing nucleic acid sequences; 

(3) a method of providing or identifying a population of 
nucleic acid sequences; 

(4) a method of analyzing a nucleic acid sequence to determine 
the Tm involved with introducing a change; 

(5) a computer-readable file, having a record which includes an 
element that identifies a nucleic acid, and an element which 
described the CFD, or one or more of its components; and 

(6) a set of nucleic acids. 

USE - The method is useful for microarray 
applications involving nucleic acid hybridization. 

ADVANTAGE - The method obviates the need to consider or 
evaluate, explicitly, order dependent sequence specific interactions 
(e.g., singlet, nearest-neighbors, and next-nearest-neighbors 
interactions) . Further, it has enhanced predictive power 
over existing analytical methods. 
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Determining the likelihoood for aneuploidy in a 

cell type or organism, for example, a deletion or 

duplication of part of a chromosome or of 10 

adjacent genes, comprises using expression 

profiles . 

B04 D16 S03 T01 

HUGHES, T R; MARTON, M J 

(ROSE-N) ROSETTA INPHARMATICS INC 
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PATENT NO 



KIND DATE 



WEEK 
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WO 2002044411 Al 20020606 (200257)* EN 137 
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APPLICATION DETAILS: 



PATENT NO 



KIND 



APPLICATION 



DATE 



WO 2002044411 Al 



WO 2000-US35352 20001222 



PRIORITY APPLN. INFO: US 2000-250597P 20001201 

AN 2002-537459 [57] WPIDS 

AB WO 200244411 A UPAB: 20020906 

NOVELTY - Determination of the likelihood of aneuploidy in a cell 
type or organism (A) by the use of expression profiles. 

DETAILED DESCRIPTION - Determination of the likelihood of 
aneuploidy in a cell type or organism (A) by: 

(a) quantifying levels of cellular constituents associated with 
genes (at least 1000, preferably 50000) mapped to different 
chromosomes in the genome in a cell of (A) ; 

(b) comparing the quantified levels of cellular constituents 
associated with the genes mapped to the same chromosome, to the mean 
quantified levels of the cellular constituents associated with the 
genes; and 

(c) identifying the genes mapped to the same chromosome for 
which the quantified levels of cellular constituents associated with 
the genes are the same for each of the genes and are dissimilar to 
the mean quantified levels of the cellular constituents associated 
with the genes. 

The step (c) indicates that aneuploidy of the same chromosome 
or its portion- is likely to be present in (A) . 
INDEPENDENT CLAIMS are included for: 

(1) detecting the predisposition of (A) to a disease associated 
with aneuploidy; 

(2) determining the likelihood of aneuploidy in (A) by 
detecting an expression bias shared by genes mapped to a single 
chromosome or mapped to a chromosomal portion in the cell of (A) ; 

(3) detecting the presence of aneuploidy in (A) by comparing 
(al) a profile or a predicted profile derived from the 

first profile to a database comprising several landmark profiles to 
determine at a landmark profile most similar to the first or 
predicted profile; 

(4) diagnosing a disease associated with aneuploidy in (A) 
using (al) ; 
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(5) determining the likelihood of aneuploidy of a gene in (A) 
by identifying (bl) a cellular constituent that is a member of a 
wild-type co-varying cellular constituent set; 

(6) correcting a profile of a cell type or organism for 
aneuploidy of a chromosome or chromosomal segment by determining 
(cl) the value (ml) of the mean chromosomal offset ratio for several 
genes mapped to the chromosome or chromosomal segment in (A) and 
dividing (c2) the mean quantified level of several cellular 
constituents associated with several genes mapped to the chromosome 
or chromosomal segment by the value of the mean chromosomal offset 
ratio; 

(7) correcting a profile of a cell type or organism for 
aneuploidy by determining (dl) the value (m2) of the mean 
chromosomal offset ratio for several genes associated with several 
constituents whose mean quantified level is altered by the presence 
of a gene in (A) having an abnormal copy number, and dividing (d2) 
the mean quantified level of the several cellular constituents that 
are altered by the presence of a gene having an abnormal copy number 
by the value of the mean offset ratio or the step (d2) for several 
genes; 

(8) a computer system for determining the likelihood of 
aneuploidy in (A) , for detecting the predisposition of (A) to a 
disease associated with aneuploidy and for diagnosing the disease 
associated with the known aneuploidy in (A) comprising a processor 
unit and a memory unit; 

(9) a computer program product for directing a user computer in 
a computer-aided determination of the likelihood of aneuploidy in 
(A) comprising either a computer code for carrying out the step (b) 
or (c); 

(10) a computer system for correcting a profile of (A) for 
aneuploidy of a chromosome or chromosomal segment comprising 
processor units and the memory units, where the memory units contain 
a program, which cause the processor units to execute either the 
steps (cl) and (c2) or the steps (dl) and (d2), or the step (c2) for 
several genes, or the step (d2) for several genes; 

(11) a kit for detecting the presence of aneuploidy comprising 
a positionally-addressable array of polynucleotide probes bound to a 
support and either expression profiles, in electronic or written 
form, each correlated to a known alteration in copy number of a 
gene, or a container comprising RNA, or cDNA, of a cell having the 
known aneuploidy. 

USE - The method is used for determining the likelihood of 
aneuploidy (e.g. a deletion or duplication of part of a chromosome 
or of 10 adjacent genes) in (A) e.g. a human, for detecting the 
predisposition of (A) to a disease associated with aneuploidy 
resulting from: 

(i) amplification deletion translocation of a gene, where the 
disease is cancer (e.g. breast cancer, colon cancer, acute 
myelogenous leukemia, chronic myelocytic leukemia, acute 
promyelocytic leukemia, acute nonlymphocytic leukemia, acute 
monocytic leukemia, acute myelomonocytic leukemia, Burkitt's 
lymphoma, non-Hodgkin ' s lymphoma, acute lymphoblastic leukemia, 
chronic lymphocyticleukemia, myeloproliferative diseases, small cell 
lung cancer, kidney cancer, uterin cancer, cervical cancer, prostate 
cancer, bladder cancer, ovarian cancer, liposarcoma, synovial 
sarcoma, rhabdomyosarcoma, extraskeletal myxoid chondrosarcoma, 
Ewing's tumor, peripheral neuroepithelioma, testicular and ovarian 
dysgerminoma, retinoblastoma, Wilms 1 tumor, neuroblastoma, malignant 
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melanoma, and mesothelioma); 
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COUNTRY: United States 

DOCUMENT TYPE: Journal; Article 

FILE SEGMENT : 005 General Pathology and Pathological Anatomy 

LANGUAGE: English 
SUMMARY LANGUAGE: English 

AB Squamous cell carcinomas of the head and neck constitute an 
anatomically heterogeneous group of neoDlasms that ZhZll * 

Here we report the use of oDNS microariay, contirnina »2r6 
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ENTRY DATE: 



AB 



Division of Hematology and Internal Medicine, Mayo 
Clinic, Rochester, ' Minn 55905, USA. 
MAYO CLINIC PROCEEDINGS, (2002 Sep) 77 (9) 927-40. 
Ref: 53 

Journal code: 0405543. ISSN: 0025-6196. 
United States 

Journal; Article; (JOURNAL ARTICLE) 
General Review; (REVIEW) 
(REVIEW, TUTORIAL) 
English 

Abridged Index Medicus Journals; Priority Journals; 
Space Life Sciences 
200210 

Entered STN: 20020918 
Last Updated on STN: 20021029 
Entered Medline: 20021021 
Genomics has been defined as the comprehensive study of whole sets 
of genes, gene products, and their interactions as opposed to the 
study of single genes or proteins. Microarray technology 
is one of many novel tools that are allowing global and 
high-throughput analysis of genes and gene products. In addition to 
an introduction on underlying principles, the current review focuses 
on the use of both complementary DNA and 
oligodeoxynucleotide microarrays in gene expression 

analysis. Genome-wide experiments generate a massive amount of data 
points that require systematic methods of analysis to extract 
biologically useful information. Accordingly, the current 
educational communication discusses different methods of data 
analysis, including supervised and unsupervised clustering 
algorithms. Illustrative clinical examples show clinical 
applications, including (1) identification of candidate genes or 
pathological pathways (ie, elucidation of pathogenesis); (2) 
identification of "new" molecular classes of diseases that may be 
relevant in disease reclassification, prognostication, and treatment 
selection (ie, class discovery); and (3) use of expression profiles 
of known disease classes to predict diagnosis and 
classification of unknown samples (ie, class prediction) . 
The current review should serve as an introduction to the subject 
for clinician investigators, physicians and medical scientists in 
training, practicing clinicians, and other students of medicine. 
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An algorithm for finding protein-DNA 
binding sites with applications to 
chromatin-immunoprecipitation microarray 
experiments . 
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correlation with an archived profile could lead to some knowledge 
regarding the potential effects of the tested compound or exposure. 
We have previously shown that cDNA microarrays can be used 
to generate chemical-specific gene expression profiles that can be 
distinguished across and within compound classes, using clustering, 
simple correlation, or principal component analyses. In this 
report, we test the hypothesis that knowledge can be gained 
regarding the nature of blinded samples, using an initial training 
set comprised of gene expression profiles derived from rat liver 
exposed to clofibrate, Wyeth 14,643, gemfibrozil, or phenobarbital 
for 24 h or 2 weeks of exposure. Highly discriminant genes were 
derived from our database training set using approaches including 
linear discriminant analysis (LDA) and genetic algorithm 
/K-nearest neighbors (GA/KNN) . Using these genes in the analysis of 
coded liver RNA samples derived from 24-h, 3-day, or 2-week 
exposures to phenytoin, diethylhexylpthalate, or hexobarbital led to 
successful prediction of whether these samples were 
derived from livers of rats exposed to enzyme inducers or to 
peroxisome prolif erators . This validates our initial hypothesis and 
lends credibility to the concept that the further development of a 
gene expression database for chemical effects will greatly enhance 
the hazard identification processes. 
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Program, Boston University, Boston, MA 02215, United 
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SOURCE: Trends in Biotechnology, (1 Jan 2002) 20/1 (29-35). 

Refs: 104 

ISSN: 0167-7799 CODEN: T RBI DM 
PUBLISHER I DENT. : S 0167-7799(01)0184 6-7 
COUNTRY: United Kingdom 

DOCUMENT TYPE: Journal; General Review 

FILE SEGMENT: 004 Microbiology 

022 Human Genetics 

LANGUAGE: English 
SUMMARY LANGUAGE: English 

AB Recent advances in massively parallel experimental and computational 
technologies are leading to radically new approaches to the early 
phases of the drug production pipeline. The revolution in DNA 
microarray technologies and the imminent emergence of its 
analogue for proteins, along with machine learning 
algorithms, promise rapid acceleration in the identification 
of potential drug targets, and in high-throughput screens for 
subpopulation-specif ic toxicity. Similarly, advances in structural 
genomics in conjunction with in vitro and in silico evolutionary 
methods will rapidly accelerate the number of lead drug candidates 
and substantially augment their target specificity. Taken 
collectively, these advances will usher in an era of 
predictive medicine, which will move medical practice from 
reactive therapy after disease onset, to proactive prevention. 
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Entered Medline: 20010322 
AB The most important product of the sequencing of a genome is a 

complete, accurate catalogue of genes and their products, primarily 
messenger RNA transcripts and their cognate proteins. Such a 
catalogue cannot be constructed by computational annotation alone; 
it requires experimental validation on a genome scale. Using 'exon' 
and 'tiling' arrays fabricated by ink-jet oligonucleotide synthesis, 
we devised an experimental approach to validate and refine 
computational gene predictions and define full-length 
transcripts on the basis of co-regulated expression of their exons . 
These methods can provide more accurate gene numbers and allow the 
detection of mRNA splice variants and identification of the tissue- 
and disease-specific conditions under which genes are expressed. We 
apply our technique to chromosome 22q under 69 experimental 
condition pairs, and to the entire human genome under two 
experimental conditions. We discuss implications for more 
comprehensive, consistent and reliable genome annotation, more 
efficient, full-length complementary DNA cloning 
strategies and application to complex diseases. 
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LANGUAGE: English 

AB Despite high-dose chemotherapy, bisphosphonates and thalidomide, 

multiple myeloma (MM) remains an incurable malignancy. Possible ways 
to improve survival, and ideally to achieve cure, are to identify 
patients at an earlier stage of the disease, and/or to identify new 
therapeutic targets enabling the development of novel therapeutic 
strategies. Gene-expression profiling using complementary 
DNA (cDNA) microarrays allowed analysis of thousand 
markers and has been successfully used to identify unrecognized 
subgroups of a variety of cancers including breast tumors, melanoma 
and diffuse large B-cell lymphoma. In this study we examined gene 
expression pattern in 53 patients with MM, 4 with plasma cell 
leukemia (PCL), 9 with smoldering myeloma (SMM), 6 with monoclonal 
gammopathy of undetermined significance (MGUS) and in 3 normal 
plasma-cell specimens. To help for the interpretation of the 
variation in the tumor samples we also established gene profiling of 
2 human myeloma cell lines (HMCL) cultured in five different 
conditions. Overall, we analyzed 84 samples of either normal or 
malignant highly purified CD138+ plasma cells using nylon DNA 
microarray consisting of 7200 human genes. Hierarchical 
clustering algorithm has been applied to identify common 
patterns of gene expression. Genes were grouped on the basis of 
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AB Breast cancer is characterized by an important histoclinical 

heterogeneity that currently hampers the selection of the most 
appropriate treatment for each case. This problem could be solved by 
the identification of new parameters that better predict 
the natural history of the disease and its sensitivity to treatment. 
A large-scale molecular characterization of breast cancer could help 
in this context. Using cDNA arrays, we studied the quantitative mRNA 
expression levels of 176 candidate genes in 34 primary breast 
carcinomas along three directions: comparison of tumor samples, 
correlations of molecular data with conventional histoclinical 
prognostic features and gene correlations. The study evidenced 
extensive heterogeneity of breast tumors at the transcriptional 
level. A hierarchical clustering algorithm identified two 
molecularly distinct subgroups of tumors characterized by a 
different clinical outcome after chemotherapy. This outcome could 
not have been predicted by the commonly used histoclinical 
parameters. No correlation was found with the age of patients, tumor 
size, histological type and grade. However, expression of genes was 
differential in tumors with lymph node metastasis and according to 
the estrogen receptor status; ERBB2 expression was strongly 
correlated with the lymph node status (P < 0.0001) and that of GATA3 
with the presence of estrogen receptors (P < 0.001). Thus, our 
results identified new ways to group tumors according to outcome and 
new potential targets of carcinogenesis. They show that the 
systematic use of cDNA array testing holds great promise to improve 
the classification of breast cancer in terms of prognosis and 
chemosensitivity and to provide new potential therapeutic targets. 
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AB We have identified human prostate cancer- and tissue-specific genes 
using cDNA library subtraction in conjunction with high throughput 
microarray screening. Subtracted cDNA libraries of prostate 
tumors and normal prostate tissue were generated. Characterization 
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of subtracted libraries showed enrichment of both cancer- and 
tissue-specific genes. Highly redundant clones were eliminated by 
colony hybridization. The remaining clones were selected for 
microarray to determine gene expression levels in a variety 
of tumor and normal tissues. Clones showing overexpression in 
prostate tumors and/or normal prostate tissues were selected and 
sequenced. Here we report the identification of two genes, P503S 
and P504S, from subtracted libraries and a third gene, P510S, by 
subtraction followed by microarray screening. Their 
expression profiles were further confirmed by Northern blot, 
real-time PCR (TaqMan) , and immunohistochemistry to be overexpressed 
in prostate tissues and/or prostate tumors. Full-length cDNA 
sequences were cloned, and their subcellular locations were 
predicted by a bioinformatic algorithm, PSORT, to 
be plasma membrane proteins. The genes identified through these 
approaches are potential candidates for cancer diagnosis and 
therapy. 

L23 ANSWER 22 OF 23 SCISEARCH COPYRIGHT 2003 ISI (R) 
ACCESSION NUMBER: 2001:111676 SCISEARCH 
THE GENUINE ARTICLE: 394 ZG 

TITLE: Toxicogenomics-based discrimination of toxic 

mechanism in HepG2 human hepatoma cells 

AUTHOR: Burczynski M E; McMillian M; Ciervo J; Li L; Parker 

J B; Dunn R T; Hicken S; Farr S; Johnson M D 
(Reprint) 

CORPORATE SOURCE: Robert Wood Johnson Pharmaceut Res Inst, Drug Safety 

Evaluat, POB 300, Route 202, Raritan, NJ 08869 USA 
(Reprint); Robert Wood Johnson Pharmaceut Res Inst, 
Drug Safety Evaluat, Raritan, NJ 08869 USA; Phase 1 
Mol Toxicol Inc, Santa Fe, NM 87505 USA 

COUNTRY OF AUTHOR: USA 

SOURCE: TOXICOLOGICAL SCIENCES, (DEC 2000) Vol. 58, No. 2, 

pp. 399-415. 

Publisher: OXFORD UNIV PRESS, GREAT CLARENDON ST, 
OXFORD 0X2 6DP, ENGLAND. 
ISSN: 1096-6080. 

DOCUMENT TYPE: Article; Journal 

LANGUAGE: English 

REFERENCE COUNT: 38 

^ABSTRACT IS AVAILABLE IN THE ALL AND I ALL FORMATS* 

AB The rapid discovery of sequence information from the Human Genome 

Project has exponentially increased the amount of data that can be 
retrieved from biomedical experiments. Gene expression profiling, 
through the use of microarray technology, is rapidly 
contributing to an improved understanding of global, coordinated 
cellular events in a variety of paradigms. In the field of 
toxicology, the potential application of toxicogenomics to indicate 
the toxicity of unknown compounds has been suggested but remains 
largely unsubstantiated to date. A major supposition of 
toxicogenomics is that global changes in the expression of 
individual mRNAs (i.e., the transcriptional responses of cells to 
toxicants) will be sufficiently distinct, robust, and reproducible 
to allow discrimination of toxicants from different classes. 
Definitive demonstration is still lacking for such specific "genetic 
fingerprints, " as opposed to nonspecific general stress responses 
that may be indistinguishable between compounds and therefore not 
suitable as probes of toxic mechanisms. The present studies 
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simultaneously the expression pattern of thousands of genes. It is 
of interest to develop techniques for extracting useful information 
from the resulting data sets. Here we report the application of SPSS 
8.0 software for analyzing a data set consisting of expression 
patterns of 31 multiple myeloma and 7 normal tonsilar derived plasma 
cell samples. RNA was isolated from highly purified cells (>90% 
pure) converted to cRNA and hybridized to the HuGeneFL 
microarray evaluating, in a quantitative fashion, 6, 800 
genes (Affymetrix, Santa Clara CA) . Normalized data was subjected to 
analysis with the t-test or Student t-test to determine whether if 
the groups were different. Two-way clustering algorithm, 
applied to both the genes and abnormal verses normal cell type, 
revealed broad coherent patterns that suggests a high degree of 
organization underlying the gene expression. The clustered gene 
expression data groups were used to identify genes of known or 
similar function, tissue specificity and similar subtypes of 
myeloma. Previous studies have suggested that the transcription 
factor NF-kB regulates the expression of MB-1. A correlation 
analysis to study the relationship of NF-kB and MB-1 expression in 
plasma cells from the 31 MM patients and 7 tonsils resulted in a 
Pearson correlation coefficient value of of 0.27 and 0.25, 
respectively (>0.7 being significant). Theses data indicated that no 
correlation exists between expression of NF-kB and MB-1 in myeloma 
or normal tonsilar plasma cells. It is possible that correlations 
will exists between novel genes and genes known to be regulated by 
NF-kB, a gene important in B-cell development and possibly plasma 
cell neoplasia. Regression analysis making use of the correlation 
among gene expression levels was used to establish a 
prediction equation in which independent variables were each 
assigned a weight based on their relationship to the dependent. An 
automatically derived class predictor was able to 

determine the class of new MM cases. These results demonstrate that 
the SPSS-8.0 software can be used to organize genes into 
biologically relevant clusters based on expression profiles and 
suggest a general strategy for the dissection of the genetic changes 
in myeloma and possible class discovery. 
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